Affective classification of generic audio clips using regression models

نویسندگان

Nikos Malandrakis

Shiva Sundaram

Alexandros Potamianos

چکیده

We investigate acoustic modeling, feature extraction and feature selection for the problem of affective content recognition of generic, non-speech, non-music sounds. We annotate and analyze a database of generic sounds containing a subset of the BBC sound effects library. We use regression models, longterm features and wrapper-based feature selection to model affect in the continuous 3-D (arousal, valence, dominance) emotional space. The frame-level features for modeling are extracted from each audio clip and combined with functionals to estimate long term temporal patterns over the duration of the clip. Experimental results show that the regression models provide similar categorical performance as the more popular Gaussian Mixture Models. They are also capable of predicting accurate affective ratings on continuous scales, achieving 62-67% 3-class accuracy and 0.69-0.75 correlation with human ratings, higher than comparable numbers in literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Understanding Affective Content of Music Videos through Learned Representations

In consideration of the ever-growing available multimedia data, annotating multimedia content automatically with feeling(s) expected to arise in users is a challenging problem. In order to solve this problem, the emerging research field of video affective analysis aims at exploiting human emotions. In this field where no dominant feature representation has emerged yet, choosing discriminative f...

متن کامل

Predicting Affect in Music Using Regression Methods on Low Level Features

Music has been shown to impact the affective states of the listener. The emotion in music task at the MediaEval challenge 2015 focuses on predicting the affective dimensions of valence and arousal in music using low level features. In particular, this edition of the challenge involves prediction on full length songs given a training set containing smaller 30 second clips. We approach the proble...

متن کامل

بازشناسی خودکار حالت عاطفی مبتنی بر تغییرات فیزیولوژیک

Recently, automatic affective state recognition has been noteworthy for improving Human Computer Interaction (HCI), clinical researches and other various applications. Little attention has been paid so far to physiological signals for affective state recognition compared to audio-visual methods. Different affective states stimulate the Autonomic Nervous System (ANS) and lead to changes in physi...

متن کامل

Audio-Video based Classification using SVM and AANN

This paper presents a method to classify audio-video data into one of five classes: advertisement, cartoon, news, movie and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips a...

متن کامل

Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines

In recent years, available audio corpora are rapidly increasing from fast growing Internet and digital libraries. How to classify and retrieve sound files relevant to the user’s interest from large databases is crucial for building multimedia web search engines. In this paper, content-based technology has been applied to classify and retrieve audio clips using a fuzzy logic system, which is int...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Affective classification of generic audio clips using regression models

نویسندگان

چکیده

منابع مشابه

Understanding Affective Content of Music Videos through Learned Representations

Predicting Affect in Music Using Regression Methods on Low Level Features

بازشناسی خودکار حالت عاطفی مبتنی بر تغییرات فیزیولوژیک

Audio-Video based Classification using SVM and AANN

Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines

عنوان ژورنال:

اشتراک گذاری